Application No.: 09/914,573 

Amendments to the Claims 

This listing of claims will replace all prior versions, and listings, of claims in the 
application. 
Listing of Claims: 

1. (Currently Amended) A document image processor comprising! 

image inputting means for preparing a document image imag e s by reading a paper 
document^[[J] 

region dividing means for dividing the document image into a plurality of regions; 

[[,]] and 

title-region extracting means for calculating first averages as an average of 
character size for characters in each region divided by the region dividing means, and 
then extracting title regions from the e ntir e respective regions according to the first 
averages, a r e gion av e rag e charact e r siz e e quival e nt to an av e rag e siz e of charact e rs that is 
calculat e d p e r r e gion divid e d by th e r e gion dividing m e ans, 

wherein the title-region extracting means further comprises: 

means for calculating a second average that is an average of character size for characters 
within all the regions; 

means for comparing the first averages and extracting criteria found by 
multiplying the second average by extracting parameters, the extracting parameters on a 
plurality of levels calculated based on a value found by dividing a maximum of the first 
averages by the second average; and 

means for extracting the regions with the first average larger than the extracting 
criteria, as the title region. 
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compar e e ach r e gion averag e charact e r siz e and an extracting criterion r e sp e ctiv e ly; th e 
e xtracting crit e rion that is a total av e rag e charact e r size multipli e d by an e xtracting param e t e r; 
th e total av e rag e charact e r size calculat e d as a value equival e nt to an av e rag e size of all 
charact e rs included in th e e ntir e r e gions: and e xtracts as a titl e r e gions with th e r e gion av e rag e 
charact e r siz e larg e r than th e e xtracting crit e rion. 

2. (Currently Amended) A document image processor according to claim 1, wherein the 
title-region extracting means calculates the first averages and the second average th e r e gion 
av e rag e charact e r siz e and th e total av e rag e charact e r siz e based on an average height of 
characters. 

3. (Currently Amended) A document image processor according to claim 1, wherein the 
title-region extracting means calculates the first averages and the second average th e r e gion 
av e rag e charact e r siz e and th e total av e rag e charact e r siz e based on an average width of 
characters. 

4. (Currently Amended) A document image processor according to claim 1 , wherein the 
title-region extracting means calculates the first averages and the second average th e r e gion 
av e rag e charact e r siz e and th e total av e rag e charact e r siz e based on an average area of 
characters. 

5. (Cancelled) 
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6. (Currently Amended) A document image processor according to claim 1 , wherein the 
means for extracting the regions as the title region further extracts each level attribute 
indicating the level corresponding to each extracted title region, titl e r e gion e xtracting m e ans 
calculat e s th e e xtracting crit e rions on a plurality of l e v e ls by using th e e xtracting 
param e ters on a plurality of l e v e ls and e xtracts e ach titl e r e gion corr e sponding to e ach 
l e v e l attribut e indicating th e l e v e l of th e e xtracting. 

7. (Cancelled) 

8. (Currently Amended) A document image processor according to claim 1, wherein the 
title-region extracting means adopts the trim av e rag e trimmed mean method for discarding a 
specific proportion of the minimum and the maximum values and then computing the 
means of the remaining values, in order to calculate the first averages and the second 
average of character size, calculating th e total av e rag e charact e r siz e and th e r e gion 
av e rag e charact e r siz e according to charact e rs e xcluding both charact e rs larg e r than th e 
sp e cific ratio and charact e rs small e r than th e sp e cific ratio. 

9. (Currently Amended) A document image processor according to claim 1 , which further 
comprising correcting means for correcting character strings of the extracted title regions. 

10. (Cancelled) 
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11. (Currently Amended) A document title extracting method for [[of]] a document image 
processor comprising: 

inputting and an image inputting step of preparing a document image imag e s by 
reading a paper document; 

a dividing step of dividing a plurality of regions from [[a]] the document image; 

a calculating step of calculating first averages as an average of character size for 
characters in each region a r e gion av e rage charact e r siz e e quival e nt to th e av e rag e siz e 
of charact e rs p e r r e gion ; and 

a title-region extracting step of extracting title regions r e gion from the eatife respective 
regions bas e d on th e r e gion av e rag e character siz e according to the first averages, and 

wherein the calculating step comprises a step for calculating a second average 
that is an average of character size in all the regions, 

the title-region extracting step comprises a step of comparing the first averages 
and extracting criteria found by multiplying the second average by extracting 
parameters, the extracting parameters on a plurality of levels calculated based on a value 
found by dividing a maximum of the first averages by the second average; and 

a step of extracting the regions with the first average more than the extracting 
criteria, as the title region. 

in which th e step of calculating compris e s calculating a total av e rag e charact e r siz e 
e quival e nt to th e av e rag e siz e of charact e rs in th e e ntir e r e gions, 

and furth e r comprising comparing th e r e gion av e rag e charact e r siz e and a e xtracting 
crit e rion that is th e total av e rag e charact e r siz e multipli e d by an e xtracting param e t e r; and 
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in which th e st e p of e xtracting titl e region compris e s e xtracting as a titl e region r e gions 
with the region av e rag e charact e r siz e larg e r than the extracting crit e rion. 

12. (Currently Amended) A document title extracting method for [[of]] a document image 
processor according to claim 1 1, in which the st e p of calculating step comprises a step of 
calculating the first averages and the second average r e gion av e rag e charact e r siz e and th e 
total av e rag e charact e r siz e based on an average height of characters. 

13. (Currently Amended) A document title extracting method for [[of]] a document image 
processor according to claim 1 1, in which the s t e p of calculating step comprises a step of 
calculating the first averages and the second average r e gion av e rag e charact e r siz e and th e 
total av e rag e charact e r siz e based on an average width of characters. 

14. (Currently Amended) A document title extracting method for [[of]] a document image 
processor according to claim 11, in which the siep-^f calculating step comprises a step of 
calculating the first averages and the second average r e gion av e rag e charact e r size and th e 
total av e rag e charact e r siz e based on an average area of characters. 

15. (Cancelled) 

16. (Currently Amended) A document title extracting method for a document image 
processor according to claim 1 1 claim 14 , in which the step of extracting the regions as the 
title region further extracts each level attribute indicating the level corresponding to each 
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extracted title region, st e p of e xtracting titl e s compris e s calculating th e e xtracting 
crit e rions on a plurality of l e v e ls by using th e extracting param e t e rs on a plurality of 
lev e ls and extracting e ach titl e r e gion corr e sponding to e ach l e v e l attribut e indicating 
th e l e v e l of th e extracting. 

17. (Cancelled) 

18. (Currently Amended) A document title extracting method for [[of]] a document image 
processor according to claim 1 1, in which the st e p of extracting titl e title-region extracting 
step comprises a step of calculating the first averages and the second average total averag e 
charact e r siz e and th e r e gion av e rag e charact e r siz e according to the trim av e rag e 
trimmed mean method for discarding a specific proportion of the minimum and the 
maximum values and then computing the means of the remaining values, that calculat e s 
th e av e rag e of charact e rs e xcluding both th e characters larg e r than th e sp e cific ratio and 
th e charact e rs small e r than th e sp e cific ratio. 

19. (Original) A document title extracting method of a document image processor according to 
claim 11, further comprising the step of: 

correcting character strings of the extracted title regions. 

20. (Cancelled) 
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21 . (Currently Amended) A r e cording computer readable medium storing a program 
for performing the steps of for r e cording programs comprising : 

dividing a document image imag e s prepared by reading a paper document into a 
plurality of regions; 

calculating first averages as an average of character size for characters within 
each region and a second average that is an average of character size in all the regions; 

comparing the first averages and extracting criteria found by multiplying the 
second average by extracting parameters, the extracting parameters on a plurality of 
levels calculated based on a value found by dividing a maximum of the first averages by the 
second average; and 

extracting the regions with the first average more than the extracting criteria, as 
the title region. 

calculating p e r r e gion a r e gion av e rag e charact e r siz e e quival e nt to an av e rag e 
siz e of charact e rs in a r e gion and a total av e rag e charact e r siz e e quival e nt to an av e rag e 
siz e of charact e rs in th e e ntir e r e gions; 

comparing e ach r e gion av e rag e character siz e and e xtracting crit e rion that is th e 
total av e rag e charact e r siz e multipli e d by th e e xtracting param e t e r; and 

e xtracting r e gions with th e r e gion av e rag e charact e r siz e larg e r than the 
e xtracting crit e rion as a titl e r e gion. 

22 - 29. (Cancelled) 

30. (New) A document image processor comprising: 
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image inputting means for preparing a document image by reading a paper; 
region dividing means for dividing the document image into a plurality of regions; and 
title-region extracting means for calculating first averages as an average of 
character size for characters within each region divided by the region dividing means, 
and then extracting title regions from the respective regions according to the first averages, 
and 

wherein the title-region extracting means farther comprises: 

means for calculating a second average that is an average value of character size for 
characters within all the regions; 

means for comparing the first averages and extracting criteria found by 
multiplying the second average by extracting parameters; and 

means for extracting the regions with the first average more than the extracting 
criteria, as the title region, 

wherein the first averages and the second average of character size are calculated based 
on characters remaining after discarding a specific proportion of the minimum and the 
maximum values of the character size. 

31. (New) A document image processor according to claim 30, wherein the title-region 
extracting means calculates the first averages and the second average based on an average 
height of characters. 



10 



Application No.: 09/914,573 

32. (New) A document image processor according to claim 30, wherein the title-region 
extracting means calculates the first averages and the second average based on an average 
width of characters. 

33. (New) A document image processor according to claim 30, wherein the title-region 
extracting means calculates the first averages and the second average based on an average 
area of characters. 

34. (New) A document image processor according to claim 30, further comprising a 
correcting means for correcting character strings of the extracted title regions. 

35. (New) A document title extracting method for a document image processor 
comprising: 

an image inputting step of preparing a document image by reading a paper; 

a dividing step of dividing a plurality of regions from the document image; 

a calculating step of calculating first averages as an average of character size for 
characters within each region; and 

a title-region extracting step of extracting title regions from the respective regions 
according to the first averages, and 

wherein the calculating step comprise a step for calculating a second average that 
is an average of character size in all the regions, 

the title-region extracting step comprises a step of comparing the first averages 
and extracting criteria found by multiplying the second average by extracting 
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parameters, the extracting parameters on a plurality of levels calculated based on a value 
found by dividing a maximum of the first averages by the second average; and a step of 
extracting the regions with the first average larger than the extracting criteria, as the 
title region, and 

the first averages and the second average are calculated according to the trimmed 
mean method for discarding a specific proportion of the minimum and the maximum 
values and then computing the means of the remaining values. 

36. (New) A document title extracting method for a document image processor according 
to claim 35, in which the calculating step comprises a step of calculating the first averages 
and the second average based on an average height of characters. 

37. (New) A document title extracting method for a document image processor according 
to claim 35, in which the calculating step comprises a step of calculating the first averages 
and the second average based on an average width of characters. 

38. (New) A document title extracting method for a document image processor according 
to claim 35, in which the calculating step comprises a step of calculating the first averages 
and the second average based on an average area of characters. 

39. (New) A document title extracting method of a document image processor according to 
claim 35, further comprising the step of: 

correcting character strings of the extracted title regions. 
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40. (New) A document title extracting method of a document image processor according to 
claim 35, wherein the characters of which character size are lower than the specific portion are 
punctuation marks. 

41. (New) A document image processor according to claim 8, wherein the characters of 
which character size are lower than the specific portion are punctuation marks. 
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